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Abstract 



Consider the finite regular language L„ = {wO \ w S 
{0, 1}*, \w\ < n}. In it was shown that while this lan- 
guage is accepted by a deterministic finite automaton of 
size 0{n), any one-way quantum finite automaton (QFA) 
for it has size 2^("/'°s"). This was based on the fact 
that the evolution of a QFA is required to be reversible. 
When arbitrary intermediate measurements are allowed, 
this intuition breaks down. Nonetheless, we show a 2^*^") 
lower bound for such QFA for Ln, thus also improving 
the previous bound. The improved bound is obtained from 
simple entropy arguments based on Holevo 's theorem /^. 
This method also allows us to obtain an asymptotically op- 
timal (1 — H{p))n bound for the dense quantum codes 
(random access codes) introduced in /^. We then turn 
to Holevo 's theorem, and show that in typical situations, 
it may be replaced by a tighter and more transparent in- 
probability bound. 



1. Introduction 

One-way quantum finite automata (QFA) were defined 
in [|ri], ^ and have drawn much interest since because they 
reflect the capabiHties of currently feasible experimental 
quantum computers. Moreover, their study provides much 
insight into the nature of quantum computation. Results like 
those of 1^ and [||] show that the laws underlying quan- 
tum computation are a mixed blessing. [|[ shows how one 
may use superpositions to design QFA for certain languages 
that are exponentially more succinct than the corresponding 
classical FA. In contrast, other results from show that the 
reversibility requirements of quantum mechanics imposes 
serious limits on the power of QFA — they show that QFA 

*This work was initiated at tlie 1998 Elsag-Bailey - I.S.I. Foundation 
research meeting on quantum computation. 
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for certain other languages are exponentially larger than the 
corresponding DFA. In this paper we consider a different 
model of QFA (called enhanced QFA) where the state of 
the QFA can be measured while each symbol is processed. 
In the case of more general models such as quantum Turing 
machines such intermediate measurements do not increase 
the power of the model, since measurements can always be 
replaced by safe storage. However, in the case of QFA, the 
space limitations inherent in the definition preclude the pos- 
sibility of similar reasoning. Moreover, in this new model, 
the evolution of the system is no longer reversible, so the 
intuition from Ij] no longer applies. Indeed, this new 
model of QFA was suggested by Dorit Aharonov as a more 
physically appropriate model that might not suffer from un- 
necessary handicaps resulting from the reversibility prop- 
erty embedded in the definitions from [ pi] , ^ . 

In this paper, we show that enhanced QFA are also ex- 
ponentially larger than the corresponding DFA for certain 
languages. The conceptual framework for our proof is com- 
pletely different from that in [|[] . We consider the evolution 
of a QFA on a random input string and show that the en- 
tropy of the mixed state that it exists in can only increase 
with each successive symbol read. This holds true even in 
the presence of intermediate measurements. Moreover, for 
certain languages, it is possible to bound from below the 
increase in entropy that results from processing each sym- 
bol, by appealing to Holevo's theorem [||]. Finally, we can 
bound the total information capacity of the QFA in terms 
of the number of states of the QFA, and therefore obtain a 
lower bound on the number of states required to correctly 
recognize strings of the language. The new bound we get is 
tight, and therefore answers an issue left open in [|[. 

The paper [||] also introduced the novel possibility of 
dense quantum codes that seem to violate Holevo's bound 
by exploiting the fact that in general measurements do not 
commute. This raised the possibility of (for instance) parsi- 
moniously encoding an entire telephone directory such that 
any single number could be extracted from it via a suitable 
measurement. Examples of such random access codes were 
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given in [|]] that have no classical counterparts. However, 
it was also shown that no more than a logarithmic factor 
compression is achievable. We can use the same conceptual 
framework as described above to give a linear bound on the 
number of qubits required for such codes. This bound is op- 
timal up to an additive logarithmic term, as follows from the 
classical upper bound given in [|[|. Thus, quantum encod- 
ing offers no asymptotic advantage over classical encoding 
in this scenario. This resolves an open question from [|^]. 

Finally we turn our attention to Holevo's bound itself. 
Typically in quantum computation applications (though not 
in this paper), Holevo's bound is applied by converting, of- 
ten implicitly, a statement about the probability of correct 
decoding into a statement in terms of entropy, when a ran- 
dom variable X is transmitted over a quantum channel us- 
ing TO quantum bits. We give a tight bound on this decoding 
probability by a direct argument which allows us to infer 
lower bounds for m without resorting to Holevo's theorem. 
Since the probability bound is tight, the inferred bounds 
are at least as good as those implied by Holevo's theorem. 
We also provide an example where it gives a strictly bet- 
ter bound than the latter We should mention that the proof 
of Holevo's bound (which is essentially equivalent to the 
strong subadditivity property of von Neumann entropy) is 
rather involved, while the proof of the probabiUty bounds is 
quite transparent. 

2. Summary of results 

A QFA (as defined in [^) differs from a DFA in that 
its state is in general a superposition of the classical (basis) 
states. It starts in such a state, and when a new input sym- 
bol a is seen, a corresponding unitary operator Ua is applied 
to it. The state is then measured to check for acceptance, re- 
jection or continuation. If the result of the measurement is 
'continue,' the next symbol is read, otherwise the input is 
accepted or rejected. A QFA recognizes a language if all 
the strings in it (or not in it) are accepted (respectively, re- 
jected) with constant probability bounded away from 1/2. 
See Section 3.2 for a more precise definition of QFA. 



We start by showing an exponential lower bound for 
QFA. 

Theorem 2.1 Let i„ be the language 



{wQ\w e {0,1}*, \w\ < n}. 



Then, 



1. Ln is recognized by a DFA of size 0{n), 

2. Ln is recognized by some QFA, and 

3. Any QFA recognizing L„ with some constant probabil- 
ity greater than ^ has 2^'"^ states. 



Note that a 2^^"^ versus 0{n) separation is the best possi- 
ble if only finite languages (or regular languages with suf- 
ficiently high probability of acceptance by a QFA) are con- 
sidered: such languages are recognized by reversible (deter- 
ministic) FA that are at most exponentially larger than the 
corresponding DFA [^] . 

We then consider enhanced QFA, in which instead of 
only applying a unitary transformation when a new input 
symbol is seen, we allow a combination of unitary opera- 
tors and orthogonal measurements. With the introduction 
of irreversibility via measurements, it may appear that such 
automata be at least as powerful as DFA. However, it is not 
hard to verify (by applying a technique of [jl^, also used 
in [^) that enhanced QFA accept only regular languages. 



Moreover, we show that the bound of Theorem p. 1 
ues to hold. 



contm- 



Theorem 2.2 The statements of Theorem 2.1 hold also for 
enhanced QFA. 

It also follows from the proof of this theorem that enhanced 
QFA accept only a strict subset of the regular languages. 

Random access encoding was introduced in ||^] as a po- 
tentially powerful primitive in quantum information pro- 
cessing. An (n,m,p)- random access encoding is a func- 
tion / that maps n-bit strings to mixed states over to qubits 
such that, for every i G {1, . . . , n}, there is a measure- 
ment Oi with outcome or 1 that has the property that for 
all X e {0, 1}", 

Serial encoding was defined similarly, except that the mea- 
surement Oi is allowed to depend on all the subsequent 
bits Xi+i ■ ■ ■ Xn of the encoded string. The technique used 
in proving Theorem 2.1 also yields a bound for such en- 
coding. This bound matches the classical upper bound 
of (1 — H{p))n + O(logn) shown in up to the loga- 
rithmic additive term. 

Theorem 2.3 Any {n,'m,p)-random access (or serial) en- 
coding has TO > (1 — H{p))n. 

To finish, we present a simple alternative to Holevo's 
bound 

Theorem 2.4 Let X be a random variable over bit strings 
which are encoded as mixed states over to qubits and 
let P{X, d) denote the net probability of the d most likely 
strings in the sample space of X. IfY is any random vari- 
able obtained by making some measurement of the encoding 
of X, then 

L there is a decoding procedure T^q such that 

Pt[Vo{Y)^X] > 2-"^^\^\ 

where H{X\Y) is the conditional Shannon entropy 
of X with respect to Y; and 



2. for any decoding function V, 

Vt:[V{Y)=X] < P(X,2™). 

In particular, this implies that when X is distributed uni- 
formly, the mutual information I{X -.Y) of X and Y is at 
most m. Typical applications of the Holevo's bound such 
as that in [ |l0| , |[| involve only this weaker form. Our bound 
thus obviates the need for a translation of in-probability 
statements into statements about mutual information in 
these cases, also giving better bounds than Holevo's theo- 
rem in the process. 

3. Preliminaries 



First, in Section 3.1, we review the basic elements of 



quantum information theory. Then, in Section [3.2| , we de- 
fine enhanced QFA formally using some of the concepts 
presented there. 

3.1. Information theory basics 

We use the following notation in this paper Let X and Y 
be two random variables. H{X) denotes the Shannon en- 
tropy of X; H{X\Y), the conditional Shannon entropy 
of X with respect to the variable Y; and I{X : Y), the 
mutual information of the two variables X and Y . We also 
use H : [0, 1] — > [0, 1] to denote the binary entropy func- 
tion. We refer the reader to |Q| for the definition and prop- 
erties of these standard concepts from classical information 
theory. 

The quantum mechanical analogue of a random variable 
is a probability distribution over superpositions, also called 
a mixed state. Consider the mixed state {pi, \4>i)}, where 
the superposition 10;) is drawn with probability pi. The 
behaviour of this mixed state is completely characterized 
by its density matrix p — J2iPi We will therefore 

identify a mixed state with its density matrix. 

The following properties of density matrices are imme- 
diate from the definition. For any density matrix p, 

1. p is Hermitian, i.e., p — 

2. p has unit trace, i.e., Tr(p) = P{h *) = 1- 

3. p is positive semi-definite, i.e., {ip\p\ip) > for 
all 

Thus, every density matrix is unitarily diagonalizable and 
has non-negative real eigenvalues that sum up to 1. The 
von Neumann entropy S{p) of a density matrix p is defined 
as S{p) — — AilogAi, where is the multiset of 
all the eigenvalues of p. In other words, S{p) is the Shan- 
non entropy of the distribution induced by the eigenvalues 



of p on the corresponding eigenvectors. For a comprehen- 
sive introduction to this concept and its properties, see, for 
instance, [|l5[ H |l3]] . 

The density matrix corresponding to a mixed state with 
superpositions drawn from a Hilbert space H is said to have 
support in H. First, we note the following. 

Fact 3.1 If p is a density matrix with support in a Hilbert 
space of dimension d, then S{p) < logd. 

This is because the probability distribution induced by the 
eigenvalues of p has support of size at most d. The Shannon 
entropy of any such distribution is at most log d. 

When a unitary operator U is applied to a mixed state, 
the corresponding density matrix p is transformed to UpW . 
Since the eigenvalues of UpU"^ are the same as those of p, 
we conclude that entropy is invariant under unitary opera- 
tions. 

Fact 3.2 For any density matrix p and unitary operator U, 
we have S{UpW) = S{p). 

On the other hand, when we make an orthogonal measure- 
ment on a mixed state, the the entropy of the system can 
only increase. I If a mixed state p is measured according to 
an orthogonal set of projections {Pj}, it is easily verified 
that the resulting density matrix is given by J^j Pj pPj ■ 

Fact 3.3 Let p be the density matrix of a mixed state in 
a Hilbert space Ti. and let the set of orthogonal projec- 
tions {Pj} define a measurement in TC. Further, let p' — 
PjpPj be the density matrix resulting from a measure- 
ment of the mixed state with respect to this observable. 
Then S{p') > S{p). 

It is not hard to see that this is in fact a consequence of the 
property of density matrices that the entropy of any random 
variable obtained by making a measurement on a mixed 
state is at least as much as the entropy of its density matrix. 
A proof of this property may be found in [12 Chapter 9, 
pp. 262-263]. 

3.2. Enhanced one-way quantum finite automata 

An enhanced one-way quantum finite automaton (QFA) 
is a theoretical model for a quantum computer with finite 
workspace. Models for such space-restricted quantum com- 
puters were first considered by [ pi] , ^. However, these 
models did not include the full range of operations allowed 

' This fact may appear to be counterintuitive at first, since tlie entropy 
of a system is usually understood to quantify our ignorance of the state 
of the system, and making a measurement reveals some information about 
its state. However, it should be noted that the increase in entropy is not 
claimed in the state of the system conditioned on the state of the observer, 
but in the state of the system with the state of the observer traced out. 



by the laws of quantum mechanics. In particular, the model 
of does not include measurements as intermediate steps 
in a computation, and the model of allows only measure- 
ments that check for acceptance, rejection or continuation. 
The model we describe below rectifies this situation by al- 
lowing any orthogonal measurement as a valid intermedi- 
ate computational step. Our model may be seen as a finite 
memory version of the mixed state quantum computers de- 
fined in Note that we do not allow the more general 
"positive operator valued measurements" because the im- 
plementation of such measurements involves the joint uni- 
tary evolution of the state of the automaton with a fresh 
set of ancilla qubits, which runs against the (fixed finite 
workspace) spirit of the model. 

In abstract terms, we may define an enhanced QFA as 
follows. It has a finite set of basis states Q, which consists 
of three parts: accepting states, rejecting states and non- 
halting states. The sets of accepting, rejecting and non- 
halting basis states are denoted by QaccQrcj and Qnon, 
respectively. One of the states, qq, is distinguished as the 
starting state. 

Inputs to a QFA are words over a finite alphabet S. 
We shall also use the symbols and '$' that do not be- 
long to E to denote the left and the right end-marker, re- 
spectively. The set r = S U {(|;,$} denotes the work- 
ing alphabet of the QFA. For each symbol G F, an en- 
hanced QFA has a corresponding "superoperator" Ua which 
is given by a composition of a finite sequence of uni- 
tary transformations and orthogonal measurements on the 
space C'^ . An enhanced QFA is thus defined by describ- 
ing Q, Q 

acc; Qrcj 7 Qnon-; 

qo, E, and Ua for all a eT. 
At any time, the state of a QFA can be described by a 
density matrix with support in C'^. The computation starts 
in the state \qQ){qo\- Then transformations corresponding 
to the left end marker '(];,' the letters of the input word x 
and the right end marker '$' are applied in succession to 
the state of the automaton, unless a transformation results 
in acceptance or rejection of the input. A transformation 
corresponding to a symbol a E T consists of two steps: 

1. First, Ua is applied to p, the current state of the au- 
tomaton, to obtain the new state p'. 

2. Then, p' is measured with respect to the observ- 



able Ei,cc 



)E,.cj®Enon, where i^acc = span{|g) | q e 
= span{|g) | q e Qrcj}, Enon = 



rcj 



span{|g) | q g Qnon}- The probability of observ- 
ing Ei is equal to Tr(Pip'), where Pi is the orthogonal 
projection onto Ei. If we observe i?acc (or E^rcj), the 
input is accepted (or rejected). Otherwise, the compu- 
tation continues (with the state PnonP' Pnon), and the 
next transformation, if any, is applied. 

We regard these two steps together as reading the symbol cr. 
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Figure 1. A stream of random bits determining the 
evolution of a quantum system. 



A QFA M is said to accept (or recognize) a language L 
with probability p > -i if it accepts every word in L with 
probability at least p, and rejects every word not in L with 
probability at least p. 

The size of a finite automaton is defined as the number of 
(basis) states in it. The "space used by the automaton" refers 
to the number of (qu)bits required to represent an arbitrary 
automaton state. 

The model of QFA as defined in |^] differs from this 
model in that the superoperators Ua are all required to be 
given by unitary transformations Ua . 

4. The automata and coding lower bounds 

In this section, we prove the first three theorems of Sec- 
tion 0. They are all based on a common framework which 



we present in Section X. 1 



4.1. The conceptual framework 

Consider the evolution of the a quantum system under 
a random sequence of unitary transformations {Vi), where 
each Vi is either Uq or Ui (see Figure 1). Now suppose 
that the transformations Uq and Ui are distinguishable in 
the sense that for every superposition |0) of the system, 
Uq \(t>) can be distinguished from Ui\(f)) with success prob- 
ability, say, 2/3 by some fixed measurement. At each step, 
the system gains some information about the transforma- 
tion applied to it, and we expect the entropy of the system 
to increase accordingly. In general, we could apply one of 
two arbitrary but distinguishable quantum operations on the 
system, and we would expect the same increase in entropy. 
This is the essential content of our key lemma: 

Lemma 4.1 Let cro and ct\ be two density matrices, and 
let cr = + Ci) be a random mixture of these matri- 

ces. If O is a measurement with outcome or 1 such that 



making the measurement on cTf, yields the bit b with average 
probability p, then 

S{rr) > ^[S{ao) + S{a,)] + il-H{p)). 

This lemma is a simple corollary of the classic Holevo the- 
orem ^ from quantum information theory which bounds 
the amount of information we can extract from a quantum 
encoding of classical bits. 

Theorem 4.2 (Holevo) Let x t-^ cTx be any quantum en- 
coding of bit strings, let X be a random variable with a dis- 
tribution given by Pr [X = x] = Px, and let a = Px<^x 
be the state corresponding to the encoding of the random 
variable X.IfY is any random variable obtained by per- 
forming a measurement on the encoding, then 

I{X:Y) < S{(t) -Y,P.S{(Tx). 



Proof of Lemma 4.1 

the bit b. 



Consider crt, to be an encoding of 
If X is an unbiased boolean random variable, 
then cr represents the encoding of X. Let Y be the outcome 
of the measurement of this encoding according to O. By the 
hypothesis of the lemma, Pr [Y = X] = p. It is easy to see 
from the concavity of the entropy function that 

IiX:Y) > l-H{p) 

(cf. Fano's inequality [^). The lemma now follows from 
Theorem \.2. ■ 



4.2. The case of quantum automata 



We now prove Theorem 2.1 using this framework. The 



first two parts of the theorem are easy; we turn to part 3. We 
need the following definition from [^j] . 

Definition 4.1 An r-restricted one-way QFA for a lan- 
guage L is a one-way QFA that recognizes the language 
with probability p > i, and which halts with non-zero prob- 
ability before seeing the right end-marker only after it has 
read r letters of the input. 

We first prove a bound of 2'^^~^(p^)" for the number of 
basis states in any n-restricted QFA M for L„. Note that 
the evolution of M on reading stream of random bits cor- 
responds exactly to that of the quantum system described 
in Section 4.1 during the first n steps. So, at the end of 
reading a random n-bit string, the state of M has entropy 
at least (1 — H{p) )n. However, this entropy is bounded 
by log IQI by Fact |3.l| above, where Q is the set of basis 
states of M. This gives us the above bound. Since we will 
refer to this argument later, we formalize it below. 

Let pj, be the state of the QFA M after the fcth symbol 
of a random n-bit input has been read (0 < fc < n). 



Claim 4.3 S{pk) > (1 - H{p))k. 

Proof: Let Ua be the unitary operator of M corresponding 
to the symbol cr. Let Eq be the span of the accepting basis 
states of M and let Ei be the subspace orthogonal to it. De- 
fine the measurement O as applying the transformation U$ 
(recall that '$' is the right end-marker) and then measuring 
with respect to the observable Eq ® Ei. We can now prove 
the claim by induction. 

For fc = 0, the state of the automaton is pure, so S{pq) = 
0. Now assume that S{p,^_i) > (1 - H{p)){k - 1). After 
the fcth random input symbol is read, the state of M be- 
comes 



Pk 



\{UoPk- 



t 



UiPk-,ul). 



By the definition of M, measuring U^Pk^iUl according 
to O yields b with probability at least p > i. So by 



Lemma 4. 1 , we have 



S{Pk) 



> J E ^(^fc/'^-l^b) + (l-^(^^))■ 



(l) 



6=0,1 



But the entropy of a mixed state is preserved by unitary 
transformations (Fact 3.2), so 



S{U,p,^,ul) = S{p,_,)>{l-H{p))(k-1). 

Inequality (jl]) now gives us the claimed bound. ■ 

To pass from a bound on restricted QFA to one for gen- 
eral QFA for the language, we now invoke the following 
lemma from [|| . 

Lemma 4.4 Let M be a one-way QFA with \Q\ states rec- 
ognizing a language L with probability p. Then there is 
an r-restricted one-way QFA M' with 0(r \Q\) states that 
recognizes L with probability p. 

Thus, any general QFA for L„ using \Q\ basis states yields 
an n-restricted QFA that uses 0{n \Q\) states. By the lower 
bound derived above, we then have 

IQI > 2(i--f^(p))"-i°s"-o(i)^ 



the bound stated in Theorem 2.1 



4.3. Robustness of the automata lower bound 

As mentioned in Section |lj QFA in which general in- 
termediate measurements are allowed (which we call en- 
hanced QFA), were suggested as a way of overcoming the 
restriction of reversible evolution that leads to the exponen- 
tial lower bound shown in ||^ (and in the previous section). 
Theorem 2.2 rules out this possibility. We prove this next. 

Armed with the formalism of density matrices, it is not 
hard to verify (by using a technique of which is also 



used in 

guages. Moreover, the lower bound of Theorem |2.1 



) that enhanced QFA accept only regular lan- 

con- 



tinues to hold for such QFA, as we show below. This essen- 
tially follows from the fact that the entropy of a quantum 
system cannot decrease under the action of a sequence of 
unitary operations and orthogonal measurements. 



We now sketch how the proof of Theorem 2.2 may be 
completed. We proceed as in the previous section by first 
showing the bound for restricted enhanced QFA, which are 
defined analogously. Lemma 4.4 , which extends easily to 



enhanced QFA, then gives us the claimed bound. 

As before, we consider the state of a restricted au- 
tomaton for Ln with acceptance probability p after a ran- 
dom n-bit input has been read. Its entropy is bounded 
by log |Q|, where Q is the set of its basis states. Follow- 
ing Lemma 43 , we argue that the entropy of the automaton 
state increases by at least 1 — H{p) every time a new ran- 
dom input symbol is read. Claim extends easily to this 
case as well: initially, S{pq) > 0, and we need only prove 
that S{Ui,Pf^_i) > S{pf^_i) for 6 = 0, 1, where Ub is the 
superoperator corresponding to the bit h, and p^ is the den- 
sity matrix of the automaton state after i input symbols have 
been read. Since Ub is the composition of a finite sequence 
of unitary operators and orthogonal measurements, this is 
immediate from the monotonicity property of density ma- 
trices implied by Facts 3.2 and 3.3. 

As a simple consequence, we obtain: 

Theorem 4.5 The regular language {0, 1}*0 cannot be ac- 
cepted by any enhanced QFA with probability bounded 
away from i. 

To see this, we note that any enhanced QFA that supposedly 
recognizes this language also correctly recognizes all words 
of length at most n of the language L„, for every n. The 



proof of Theorem 2.2 now tells us that the number of states 
in the QFA is 2^'"-' for every n, which is a contradiction. 

4.4. Random access codes 



We now prove Theorem 2.3. Consider any random ac- 
cess encoding with parameters n, m,p. Let p^ denote the 
density matrix corresponding to the encoding of the n-bit 
string X. The density matrix of a random codeword is given 
by p = ^ J2x Px- We can bound the entropy of p by m 
by Fact 3.1 Using Lemma 4.1, we can also prove a lower 



bound for the entropy of p, and hence obtain a lower bound 
on m. 

For any y £ {0, l}'^, where < fc < n, let 



1 



E 



Claim 4.6 S{py) > (1 - H{p)){n - k). 

Proof: The proof is by downward induction on k. The 
base case fc = n is satisfied easily: S{py) > for all n-bit 
strings y. 

Suppose the claim is true for k + 1. We have 



Py = ^{Poy+Ply) 



By hypothesis. 



2n-fe jL^ Pzy 

ZG{0,1}""* 



S{p,y) > (l-ff(p))(n-fc-l), 

for 6 = 0, 1. Moreover, since the two density matrices are 
mixtures arising from strings that differ in the (n — fc)th bit, 
the measurement On-k distinguishes them correctly with 
probability p. Thus, by Lemma p~l| we get 

SiPy) > liSipoy) + S{p,y)) + il-Hip)), 

which gives us the claimed bound. ■ 

Theorem now follows by combining the claim 
(with y chosen to be the empty string) and the upper bound 
of m on the entropy. Notice that we could allow the mea- 
surement Oi to depend on the subsequent bits of the en- 
coded string in the argument above. This means that the 
bound holds for serial codes as well. 

We conclude this section by observing that the bound of 
Theorem 2.3 also gives a communication lower bound for 
the problem of information-theoretically secure private in- 
formation retrieval with one database (see, e.g., The 
problem may be described as the following communica- 
tion game. One party, Alice, has as input an n-bit string x 
(the database) and the second party. Bob, has an index i G 
{1, . . . , n}. Bob wishes to learn the value of the ith entry 
in the database Xi (with probability p > ^) without reveal- 
ing any information about i to Alice. The privacy condition 
fi-anslates to the fact that in any (quantum) protocol for this 
problem, Bob's computation and communication are inde- 
pendent of his input. We may also assume (by the princi- 
ple of safe storage) that no intermediate measurements are 
made during the quantum protocol. A lemma due to [ [l0[ ] 
(based on a technique from |16|) tells us that whenever 
Bob's actions in a protocol are oblivious to his input, his 
state lies in a fixed subspace of dimension 2™ independent 
of Alice's input, if rn qubits were exchanged during the pro- 
tocol. Since his state at the end of an information retrieval 
protocol is independent of i, Bob may extract any bit xj 
from the state by making a suitable measurement. Thus, 
an rn-qubit protocol defines a random access code over m 
qubits, which implies that m > (1 — H{p))n. 



We claim that 



5. An alternative to Holevo's theorem 



Claims.! E Jl f < 2' 



In this section, we prove Theorem 2.4. We first prove the 
lower bound on the decoding probability. 

Consider random variables X and Y as in the statement 
of Theorem 2.4. We describe a natural decoding proce- 



dure Vo and then show that it satisfies the requirement of the 
theorem. On input y, the decoding algorithm outputs x such 
that p^^y = ma.Xx' Px'\y, where = Vr[X = x\Y = y\. 
Let p^y'^^ denote this probability and let Xy denote the cor- 
responding X. 

Claim 5.1 The procedure Vq described above decodes cor- 
rectly with probability at least 2~^(^l^). 

Proof: The probability of correct decoding is equal to 

Vr[V^{Y) = X] 



y 



Now, H{X\Y = y) = -'}lxPx\y^'^ZPx\y > -logp; 
So p™'''' > 2-"^^\'^=y\ Taking expectation over Y, and 
noting that 2^^ Ms a convex function, we have 



max 
V 



E > 



E 



2-H(X\Y=y) 

> 2-nH{X\Y=y)] 
^ 2-H(X\Y) 



which gives us the claimed lower bound on the decoding 
probability. ■ 

We now turn to the upper bound on the probability of 
correct decoding. Consider any encoding of strings x ^ 
{(lx,ii \4>x.i)} into mixed states over m qubits, and any de- 
coding procedure V. The output of V may be viewed as 
the outcome of a measurement given by orthogonal projec- 
tions {Px} in the Hilbert space of the encoding augmented 
with some ancilla. The probability may then be bounded as 



Pr[2?(y) = X] 

= 5]Pr[p(y) 

X 



x\ 



Vy[X 



- ^Px^qx.i\\Px\(t>x,t)\Y 

X i 



(2) 



where px = Pi[X = x], and \(f)x) is the pure state \(j)x.i) 
that maximizes the probability || Px {(px.i) 11^ of obtaining 
the correct outcome x when its encoding is measured. (In all 
the expressions in this section, the ancilla qubits used in the 
measurement have been suppressed for ease of notation.) 
We can now bound the decoding probability by using the 
following claim. 



Proof: Let E be the subspace spanned by the code- 
words \(t)x), and let Q be the projection onto E. Since 
the codes are over rn qubits, E has dimension at most 2™. 
Let {|ei}} be an orthonormal basis for E. Let {|ea;.j}} be 
an orthonormal basis for the range of P^. The union of all 
these bases {\ex.j)} is an orthonormal basis for the entire 
decoding Hilbert space. Now, 

3 
3 

The last inequality follows because the length of the pro- 
jection of any vector onto a space W is at least the length 
of its projection onto a subspace V of W. Observe 

that||Q|e,j)f =E»l(e^|ex,,)|'. So, 

X i x,j 

< Ell -^11' 

i 

< 2™, 

since the orthonormal basis {|ei)} for E has size at 
most 2™, which is a bound on the dimension of E. ■ 

By (||), the probability of correct decoding is at 
most J2xPx II Px I'Px) 11^- From the claim above, this ex- 
pression is equal to J2xPx'^x, where < \x < 1 
and J2x "^x < 2™. The maximum over all such {A^;} of 
this quantity may easily be seen to be bounded by the sum 
of the 2™ largest probability masses px, i.e., by P{X, 2'"). 
Moreover, for any given X and m, there is a natural pair of 
encoding and decoding functions that achieves this bound. 
This shows that the bound is tight. 

The above bound on decoding probability can give us 
sharper bounds on the number of qubits used in an encoding 
than an application of Holevo's theorem. We illustrate this 
with an example encoding of n-bits into n + 1 orthogonal 
states \i). Half the strings are encoded as |0), a fourth as 
an eighth as |2), and so on. A random codeword from this 
code can be decoded with probability exactly {n + 1)2^", 
which yields the correct answer for the number of qubits 
used by invoking our bound. On the other hand, the mutual 
information I{X : Y) between the encoded string and its 
decoding is 



22 + 23 



n 
2" 



n 

2"' 



which sums up to 2 — 2 ^\ This gives us a lower bound 
of at most 2 when combined with Holevo's theorem. 



Note that Theorem |2.4| may be appUed in a communica- 
tion complexity context as well, when combined with the 
lemma due to [|l6[ [l^] mentioned in Section 4.4 . This im- 
plies that if after the exchange of m quantum bits, n classi- 
cal bits are transferred with success probability at least 6 > 
0, then m > n — log j-. An application of Holevo's the- 
orem along with Fano's inequality [0] would result in the 
boundm > 6n — H{S). This lower bound is a crucial ingre- 
dient in proving the quantum communication complexity of 
the inner product function 1 10|. Our result gives a bound 
similar to that shown in for computing Inner Product, 
but does not seem to generalize to the case of entanglement 
assisted communication considered in [^]. 
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